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ROTA BAXTER ALGEBRA 

THE COMBINATORIAL STRUCTURE 
OF INTEGRAL CALCULUS 

KURUSCH EBRAHIMI-FARD AND FREDERIC PATRAS 



Abstract. Gian-Carlo Rota suggested in one of his last articles the problem of developing a theory 
around the notion of integration algebras, complementary to the already existing theory of differ- 
ential algebras. This idea was mainly motivated by Rota's deep appreciation for Kuo-Tsai Chen's 
seminal work on iterated integrals. As a starting point for such a theory of integration algebras 
£f~) • Rota proposed to consider a particular operator identity first introduced by the mathematician Glen 

Baxter. Later it was coined Rota-Baxter identity. In this article we briefly recall basic properties 
of Rota-Baxter algebras, and present a concise review of recent work with a particular emphasis of 
noncommutative aspects. 



Rota-Baxter algebra; Spitzer identity; Bohnenblust-Spitzer identity, ordinary differential 
equations; Hopf algebra; pre-Lie algebra; (quasi-)shuffle algebra. 
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1. Introduction 

During the period 1960-1972 P. Cartier, G.C. Rota and collaborators set out to develop a general 
theory of the algebraic and combinatorial structures underlying integral calculus. Nowadays it has 
been subsumed by what is better known as the theory of Rota-Baxter algebras. In spite of the 
fact that those algebras need not to be commutative, a large part of the principal results have been 
described in the context of commutative function algebras. Interest in the general, i.e., noncommu- 
tative, case, which includes among several other areas the integral and finite difference calculus for 
operator algebras, started around 2000. Originally motivated by problems from theoretical physics, 
more precisely, the renormalization program in quantum field theory (that is, the process by which 
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2 KURUSCH EBRAHIMI-FARD AND FREDERIC PATRAS 

divergent integrals can be made finite), the domain of applications of the noncommutative theory 
has grown steadily since then. 

The reader will find in [32} [371 US] good surveys on the foundations of the theory of Rota-Baxter 
algebras. Neither do we strive for an exhaustive presentation of the field nor do we seek to include 
a complete list of references. The latter restriction is mainly due to the extensive developments 
that have taken place during the last ten years, which resulted in many ramifications. The present 
survey will instead try to offer insights on recent progress made in a precise direction -maybe 
the most significative one from the point of view of integral and differential calculus- namely the 
extension to the noncommutative setting of the fundamental results of the commutative theory, 
such as the Bohnenblust-Spitzer identity or the construction of the standard Rota-Baxter algebra 
(a presentation due to Rota of the free Rota-Baxter algebra in terms of symmetric functions). 

The article commences by recalling the physical as well as the probabilistic origins of the theory 
and presents the main classical examples of Rota-Baxter algebras that stem mostly (but not exclu- 
sively) from analysis. The classical Cartier-Rota (commutative) theory is then presented briefly. 
Links with symmetric functions, shuffle algebras and fundamental (Spitzer-type) identities are put 
forward. The theory is presented with a view toward its noncommutative extension; this approach 
is neither standard nor, in some sense, the most natural one in the commutative setting, but it has 
the advantage of making transparent the ideas underlying the transition from the commutative to 
the noncommutative realm. 

We enter then the core of the subject, i.e., the theory of noncommutative Rota-Baxter algebras 
by mainly insisting on three ideas. First, the existence of noncommutative analogs of most of the 
fundamental commutative constructions. This idea is illustrated by the close relationship existing 
between the structure of free Rota-Baxter algebras and noncommutative symmetric functions }30j . 
Then, the existence of noncommutative analogs of most of the fundamental commutative formulas. 
Eventually, we would like to emphasize that calculus in Rota-Baxter algebras is "generic" for several 
theories, very much as tensor calculus is generic for the theory of iterated integrals and Lie algebras 
(in a sense that can be made rigorous as in j43j : technically, formulas in the tensor algebra can be 
shown to hold in arbitrary enveloping algebras of graded Lie algebras). Theories to which such a 
remark applies in the noncommutative framework include the one of iterated integrals and sums of 
operator valued functions (corresponding respectively to noncommutative shuffle and quasi-shuffle 
products) as well as of derivations and differential operators (corresponding to pre-Lie products). 

Concretely, all this means that it is often convenient (at least according to our experience) to 
work within the setting of Rota-Baxter algebras when it comes to looking for universal formulas 
solving problems in integral calculus, difference calculus, dynamical systems, and so on. 

Acknowledgements: The first author is supported by a Ramon y Cajal research grant from the 
Spanish government. The work was supported by the project MTM201 1-23050 of the Ministerio 
de Economia y Competitividad, Spain. 



2. Rota-Baxter Algebras 

"Whereas algebraists have devoted a lot of attention to derivations, the algebraic 
theory of the indefinite integral has been strongly neglected. The shuffle identities 
are only the tip of an iceberg of algebra and combinatorics of the indefinite integral 
operator which remains unexplored". (G.C. Rota |47| ) 
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2.1. Origin and Integral Calculus. The Rota-Baxter identity and, together with it, the very 
notion of Rota-Baxter algebra first appeared in 1960 in the work of Baxteiu [6j. Baxter's paper 
originated from a result in probability theory due to Spitzer in 1956 [50] on which we will comment 
later. Various proofs of the so-called Spitzer identity had been obtained before, but Baxter's 
approach succeeded to unveil in terms of the aforementioned identity the underlying algebraic and 
combinatorial structures. 

About ten years later, Rota [21 HSJ H6] succeeded to deduce Spitzer's identity using classical re- 
sults from the theory of symmetric functions -indeed, he showed that Spitzer's identity is equivalent 
to the Waring identity. Rota and his school advocated algebraic combinatorics, both mathemat- 
ically and on epistemological grounds -an attitude that can be recognized in his introduction to 

m- 

"The spectacular results in the fluctuation theory of sums of independent random 
variables [...] have gradually led to the realization that the nature of the problem, 
as well as that of the methods of solution, is algebraic and combinatorial [...]. It is 
the present purpose to carry this algebraization to the limit: the result we present 
amounts to a solution of the word problem for Baxter algebras" . 

Another essential component of the mathematical foundation of the theory can be traced back to 
P. Cartier [10]. Similarly to Rota, Cartier considered the word problem for Rota-Baxter algebras 
(the problem amounts to constructing explicitly a basis of the free Rota-Baxter algebra), but he 
suggested a rather different approach, that lead him to the notion of quasi-shufne product. Much 
later this product were formalized by M. Hoffman in [31]. It underlies for example the combinatorial 
structure of multiple zeta values (MZVs) [llj . 

In itself, the definition of Rota-Baxter algebra is fairly elementary. Indeed, it just consists of 
an associative algebra A (say unital, and from now on over a field k of characteristic 0), equipped 
with a linear operator R : A — > A satisfying the Rota-Baxter relation: 

(1) R(x)R(y) = R(R(x)y + xR(y) + 9xy). 

The parameter 9 is a fixed element of k, which we call the weight. The sum of the three terms 
on which the map R acts on the right hand side of the equation defines a new product an A. We 
denote it *g and call it double product (of weight 9) : 

(2) x * e y := R(x)y + xR(y) +9xy. 

Observe that ([1]) implies that R(x *g y) = R(x)R(y). One checks easily that this product is 
associative and that R is still a Rota-Baxter operator for *g (that is, the Rota-Baxter relation still 
holds when one replaces the initial product on A by the double product *#). We will later see how 
useful this new product may be. 

Let us first consider the case when 9 = and A is commutative. The Rota-Baxter relation is 
then simply the usual integration by parts relation: take for example A to be the algebra C°°(R) 
of smooth functions on the real line and set R(f)(t) := L f(x)dx. For two functions /, g we obtain 
the weight zero Rota-Baxter identity: 

(3) R(f)R(g) = R{R(f)g + fR(g)). 

Even in this somewhat degenerate case the theory is not without interest since it embraces among 
others Chen's iterated integral calculus, Schiitzenberger's theory of shuffle algebras [48] as well as a 
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large part of Fliess's approach to control theory [29] (the one of chronological algebras in the sense 
of |34j). Indeed, from ([3]) follows in the weight zero case that: 

R(hR(f2 ■ ■ ■ R(fn) ■ ■ ■ ))R(giR(92 ■ ■ ■ R{g m ) ■■■)) 

(4) =J2R(h 1 R(h 2 ---R(h n+m ) ■■■)), 

where the sum runs over all the shuffles of the sequences /i, . . . , f n and g±, . . . , g m , that is, those 
sequences h± , . . . , h m+n made of the fi and the gj where the partial order of the fi and the gj is 
preserved (for example, xiyiX2y2y3X3X4 or x\X2y\y2Xzx\yz are shuffles of the words x\X2X^x^ and 
2/12/22/3, but not x\x^X2y\y2Xzy , i^ since £4 appears on the left of x<2 and X3). When R is the integral 
operator, we recognize in R(f\R(f2 ■ ■ ■ R{f n ) ■ ■ •)) the iterated integral of functions /1, . . . , / n , and 
dU) yields the well known -shuffle- product of two such iterated integrals. 

Equivalently, from a more abstract point of view, the (weight zero) double product *o = * splits 
into two components (known as the Eilenberg-MacLane and Schiitzenberger half-shuffles): 

f*g = ft9 + fig 

where 

(5) f^g:=fR(g), fig:=R(f)g. 

The products f and j. satisfy the axiomatic characterization of shuffle algebras that appeared 
independently in algebraic topology and in Lie theory in the 1950's [26ll4*8]: 

(6) aib = bta, (a t b) t c = a f (b t c + c f 6). 

The reader may check by herself that these relations are enough to insure that the product defined 
by the sum of compositions 7 + 4, is associative as well as commutative [38] . 

These ideas are fairly universal and show up in many different contexts. Let X be an alphabet 
and let us write Sh(X) for the algebra freely generated by X, and the products t, I modulo the 
relations ([6]); using the interpretation [5] of the two products f, J,, Sh(X) embeds as a subalgebra in 
the free Rota-Baxter algebra over X. In the theory of free Lie algebras as well as for MZVs, and 
actually in most application domains where the notion is relevant, Sh(X) is called the free shuffle 
algebra over X or the tensor algebra over X equipped with the shuffle product |43j . In control 
theory it is (sometimes) called the free chronological algebra over X (in the sense of Kawski), in 
the theory of operads, it is called the free Zinbiel algebra over X [38], a tribute to Cuvier's dual 
notion of Leibniz algebras [181 [19] . 

So far we have presented the simple weight zero case and related commutative algebraic struc- 
tures. The extension to the non-zero weight case as well as the generalization to noncommutative 
algebras are somewhat more involved. In the following we will outline the general theory. Saying 
this, we should remark that for the sake of space we refrain from giving an exhaustive account. How- 
ever, let us emphasize the underlying idea. Namely, Rota-Baxter algebra calculus is representative 
for a whole class of theories, including iterated integrals of scalar and operator valued functions 
-commutative and noncommutative shuffle products-, summation operations -commutative and 
noncommutative quasi-shume products-, differential operators and integration methods -pre-Lie 
products. 

2.2. Spitzer's Identity. We consider now the case of non-zero weight 9 7^ 0. Note that if R is a 
Rota-Baxter map of weight 9, then the map R' := (3R for /3 £ k is of weight f39. This permits to 
rescale the original weight 9 ^ Q to 9 = +1 (or 9 = —1). There exist several natural examples of 
zero and non-zero weight Rota-Baxter algebras: 
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• In classical fluctuation theory, where extrema of sequences of real valued random vari- 
ables play a crucial role, the operator R is defined on the characteristic function F(t) := 
E[exp(itX)] of a real valued random variable X: 

R(F)(t) :=E[exp(itX + )], 

where X + := max(0,X). One can show that R is a Rota-Baxter map of weight 8 = — 1. 
Note that one may call it the "historical" example, which motivated Spitzers' and hence 
Baxter's original works [6| 13? ] 1331 150] . 

• Let A be an associative algebra (not necessarily commutative) which decomposes into a 
direct sum of two (non-unital) subalgebras, A = A + © A~ . The two orthogonal projectors 
7r± : A — > A are Rota-Baxter maps of weight 9 = — 1. In fact, any projector which satisfies 
relation (JTJ) is of weight 9 = — 1. The dominant example for such an algebra are the Laurent 
series C[e , e]], which decompose into a "divergent" part e _1 C[e _1 ] and a "regular" part 
C[[e]]. Further below we will see its role in the modern approach to renormalization theory 
in perturbative quantum field theory. A nice example of a noncommutative Rota-Baxter 
algebra are n x n matrices together with the projector that maps a matrix a to the upper 
triangular matrix a u defined by replacing all entries below the main diagonal by zeros. 

• From the point of view of analysis and for the theory of discrete dynamical systems, the 
fundamental example of Rota-Baxter maps are summation operators. On functions / 
defined on N and with values in an associative algebra, we define the summation operator 
R(f)(n) := X^fc=o f(k)-> wmc h is a Rota-Baxter map of weight 9 = 1. Note that it is the 
right inverse of the finite difference operator A(/)(n) := f(n + 1) — f(n). 

Interest in Rota-Baxter algebras goes beyond its purely algebraic setting. Indeed, their relevance 
mainly stem from the possibility to describe interesting and universal combinatorial identities, which 
are useful, for instance, in the diverse set of examples ranging from integration and summation 
operators to projectors. One of canonical examples of such an universal combinatorial result is 
known as Spitzer's classical identity. 

Let us return to the commutative case. Spitzer's classical formula allows to calculate the char- 
acteristic function of the extrema of a discrete process Si := X\ + • • • + Xi, defined as a sequence of 
partial sums of a sequence of independent and identically distributed real valued random variables 
X{. In other words, we consider the new sequence of random variables Yi := sup(0, Si, S2, • • • , Si). 
Spitzer's formula permits to calculate the characteristic function in terms of the positive part of the 
partial sum, denoted S^~ . Let F be the characteristic function of X. In the setting of Rota-Baxter 
algebra we find: 

E[exp(itY k )} = R(F ■ R(F ■ ■■■ R(F ■ R(F)) ■■■))=: R {k) (F), 

and: 

E[exp(itS k )] = R(F k ). 

From this we deduce an identity which is true in every commutative Rota-Baxter algebra of any 
weight 9, namely: 



(7) io g (i+E^))-4-E ^ 



n>0 n>0 
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Let Ofl(-F) denote the argument of the map R on the right hand side of ([7]). It is obvious that 
fl'JF) = O^ 1 log(l + OF). Moreover, the following identity can be derived: 

e ev — 1 n >o 

Here £ is the left multiplication operator, £ x (y) := xy, and B n are the Bernoulli numbers, which 
appear in the series expansion of J^ . Later we will see how this rewriting, which appears to 
be somewhat pointless in the commutative case, allows for a straightforward generalization of the 
above identity to noncommutative Rota-Baxter algebras. The key is Magnus' seminal work on the 
solution of linear differential equations j39j (which appeared in 1954, only two years before Spitzer's 
important paper). Magnus pioneered the problem of calculating the logarithm of the solution of a 
linear initial value problem written as a time-ordered exponential. We refer the reader to [7] for a 
comprehensive review. 

Deeper into the combinatorial nature of commutative Rota-Baxter algebras goes the Bohnenblust- 
Spitzer formula, which is a multilinear identity similar to Spitzer's identity 

(9) 5X-- Wa))^) ■■■)K (n) = e ^'"ir & - ^ (n f /)> 

where S n is the symmetric group of order n. On the right hand side V n denotes set partitions of [n], 
and |vr | is the number of blocks of the partition n € V n . The size of the ith block 7r, of ir is denoted 

bi. Note that 1 refers to the double product in ([2]). The weight zero case, 9 = 0, simplifies to: 

n 
/] R {- ■ ■ R( R (Kill)) F a(2)) ■■■) F <r(n) = I| F i- 

With the goal to generalize identity ([9]) to noncommutative Rota-Baxter algebras, we again 
propose a slightly more involved rewriting. Recall the canonical cycle decomposition c\ ■ ■ ■ Ck of 
a permutation a € S n . Each cycle starts with its maximal element, and the cycles are written 
in increasing order of their first entries. For example, (32) (541) (6) (87) is such canonical cycle 
decomposition. The jth cycle is denoted Cj = (aj aj 1 . . . a,j m ._ x ), where rrij is the size of this cycle 
and a j > a h ji < I < j mj -i- 

Identity Q rewrites: 

(10) ^^(•••fl( J R(F ff(1) )F CT(2) )...)F ff(n) = E^(F!,...,F n ), 

a£Sn (TdSn 

where for each permutation a written in its canonical cycle decomposition c\ ■ ■ ■ Cf~ we define: 

k m j-l 

(11) V 9 a ( Fl ,...,F n ) := ]X 9 (( II roF.JiF^j). 

j=l i=l 

Now the operator r denotes right multiplication r x (y) := yx. Note that the second product on the 
right hand side is with respect to the composition of these multiplication operators. For instance, 
for the permutation a := (43) (512) G S§ we obtain: 

V e a {F x , . . . ,F 5 ) = r eF3 (F 4 ) * e (r eF2 r eFl )(F 5 ) = 3 (F 4 F 3 ) * e (F 5 F X F 2 ). 

In the commutative setting, the product (IL=i r 0F a . )(F a . ) is independent of the order of the 
Oji> i > 1. Therefore, several of the terms T> e a (F\, . . . , F n ) on the right hand side of (|10p may 
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coincide. The resulting coefficients II^iC^j ~~ -*-)' l ea, d to the compact form ([9]) written in terms 
of set partitions. Indeed, choose a block ttj, say, of size rrij, of a partition ir. Its first entry is fixed 
to be its maximal element. This block then corresponds to (rrij — 1)! different cycles of size rrij. 

2.3. Combinatorial Approach: Cartier and Rota. The main idea underlying the works of 
Cartier and Rota is to deduce such Spitzer-type identities in the context of the free commutative 
Rota-Baxter algebra constructed over an alphabet X. Generally speaking, the universal property 
of free algebras then implies that such formulae are automatically valid in any, that is, in the 
category of such algebras. However, we will see that the two approaches chosen by Cartier and 
Rota are rather complementary. 

Cartier's construction [10] of a free commutative Rota-Baxter algebra is a forerunner of the 
modern notion of the quasi-shuffle algebra (see [31]). In [33] Cartier's construction was made more 
explicit. Recall that the quasi-shuffle product is very present in the context of the theory of multiple 
zeta values (MZVs) and other generalizations of special functions (see |llj). 

Suppose we start with an alphabet X equipped with a commutative monoid structure. We denote 
the monoid composition additively. Letters and words are elements of X, respectively sequences 
of letters. Shuffling of two words means to arrange their letters consecutively in such a way that 
the relative orders are preserved. Summing over all possible arrangements preserving the relative 
orders gives rise to the shuffle product. For instance, the word 212425222523 is part of the shuffle 
product of the two words 212223 and 24X5X6- The quasi-shuffle product is based on the shuffle 
product, but it allows moreover for an additional operation using the underlying monoid structure 
of X in the following way. In the monomials resulting from the shuffle product of two words any two 
consecutive letters may be summed provided that they are from different words. For example, the 
word X1X4X5X2X6X3 is included in the quasi-shuffle product of the words x\X2Xj,w\A 2425X6 (i-e., no 
internal summation), but also the words x\x^y\y2 and ^1X5X22/2, where y\ := X5 + X2, 2/2 := x% + ^3 
and z\ := x\ + 24. 

A concise mathematical description of the quasi-shuffle product of two words is given in terms 
of its recursive definition [31] . Here we will present it in the context of Rota-Baxter algebra, since 
this makes the link between these two notions more transparent. Moreover, it indicates why the 
quasi-shuffle product appears to be naturally related to summation operators, such as for instance 
in the theory of MZVs, where the identity: 

C(p)C(«) = tip, q) + C(q,p) + C(p + q) 

follows from the sum-representation: 

C(^ : =E^' C(p,<r):= E ^- 

n>0 m>n>0 

The idea is similar to the weight zero case (i.e. shuffles). Let us assume that 8 = 1. The 
Rota-Baxter double product (0) decomposes into three terms: 

x *i y = R{x)y + xR(y) + xy. 

The first two terms correspond to the aforementioned "half-shuffles". The last part, xy, is just 
the original commutative and associative algebra product. Using the notations x f y := xR(y), 
x 1 y := R(x)y and x • y := xy yields: 

(12) xiy = ytx, (2 t y) t z = x\ (y t z + z\ y + y ■ z). 

When compared to the ordinary shuffle product (weight zero), the last term, y ■ z, explains the 
presence of additional terms in the quasi-shuffle product. This becomes already obvious at lower 
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degrees when compared with the shuffle formula (HJ). In the following example we separate the 
parts coming from "pure" shuffling using brackets from the additional components that result from 
the weight 6 = 1 terms. 

R(x)R(y) = [R(xR(y)) + R(yR(x))} + [R(xy)), 

R(x)R(yR(z)) = [R(xR{yR(z))) + R(yR{xR{z))) + R(yR(zR{x)))] 

+ [R(xyR{z)) + R{yR{xz))]. 

More general formulae for larger products follow from a recursive application of (p}. Note, however, 
that these identities are valid only in the commutative case. 

Rota's approach |44| [45] is rather different. It is based on the observation that for any com- 
mutative algebra A, the algebra A of functions from the non-negative integers into A, that is, 
sequences of elements of A with pointwise product, is naturally equipped with a Rota-Baxter 
algebra structure defined by the operator: 

R(ai,... ,On, •••) = (0, ai,ai + 02, . . . ,a\ -\ \-a n ,...). 

Indeed, let A be the algebra k[xi, . . . , x n , . . .] of polynomials in an infinite number of indeterminates. 
One can then show that the Rota-Baxter subalgebra generated by the sequence x := (xi, . . . , x n , . . .) 
is free. This establishes an immediate link to the well-understood classical theory of symmetric func- 
tions. In fact, the terms in Spitzer's classical identity, such as R{x n ) and R^ n '(x) = R(R^ n ~ 1 ' (x)x) , 
correspond in the theory of symmetric functions to power sums (^^ xf) and elementary symmetric 
functions (^ <...<$ x h '" x i n )i respectively. From this one concludes that Spitzer's identity fol- 
lows from Waring's formula, which expresses power sums as polynomials in elementary symmetric 
functions. 

3. Theory of Noncommutative Rota-Baxter Algebras 

Thanks to the particular way we presented the commutative case we do not have to dwell too 
much on the details of the theory of noncommutative Rota-Baxter algebras. Instead, we will try 
to sketch the techniques as well as theoretical ingredients, that permit a simple transition from the 
commutative to the noncommutative realm. 

In spite of multiple good reasons, theoretical as well as applied ones (e.g., integration and finite 
difference calculus with respect to operator algebra valued functions), interest in noncommutative 
Rota-Baxter algebra has been rather sporadic, compared to the amount of work that erupted shortly 
after Baxter's paper came out. In fact, only in the last 10 years or so the theory of noncommutative 
Rota-Baxter algebra has been developed systematically. 

Key in this renewed interest in the theory of noncommutative Rota-Baxter algebras is its link 
to renormalization in perturbative quantum field theory. Recall that the renormalization program 
permits to give a sense to integrals appearing in perturbative calculations in quantum field theory, 
which are otherwise divergent. The main idea is to extract and eliminate those parts that cause the 
divergencies in a coherent way, which is moreover compatible with the underlying physics |13[ [16] . 
This extraction and elimination, i.e., the renormalization process, is combinatorial in nature. It 
was essentially described by Bogoliubov and others, and has been known in physics for a long time 
(in fact, Bogoliubov's and collaborators work |8] appeared around the same time as Magnus' and 
Spitzer's work). See [9] for one of the earliest textbooks on the subject. Renormalization theory 
plays a fundamental role in the standard model, and the spectacular results in particle physics that 
followed from it. Let us remark that from a mathematical point of view, despite the fact that other 
approaches to the renormalization problem, like, for instance, the Epstein-Glaser method [27] or 
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Wilson's renormalization group [52], allow for both a comprehensive understanding of the origin of 
divergencies as well as the very nature of the renormalization program, Bogoliubov's subtraction 
scheme (better known as BPHZ renormalization method [16] ) still carries a veil of mystery. 

In the late 1990s, Kreimer, and Connes and Kreimer came up with a rather unexpected approach 
to the BPHZ renormalization method in terms of so-called combinatorial Hopf algebras |17[ [35] . 
In their work on the recursive subtraction algorithm of Bogoliubov in the context of dimensional 
regularization together with minimal subtraction, they needed the subtraction scheme map to 
satisfy a particular set of identities; it was pointed out by C. Brouder that these identities could be 
subsumed into a single relation, which was henceforth called multiplicativity constraint [36]. A few 
years later it was realized that the identity proposed by Brouder coincided with the Rota-Baxter 
identity. Subsequent work of several people clarified the role of the Rota-Baxter relation in the 
aforementioned work of Connes and Kreimer. We refer the reader interested in details to Manchon's 
article in La Gazette des mathematiciens [3D] , where he gives a concise and elegant account of the 
algebraic structures involved in the Connes-Kreimer theory, including the Rota-Baxter algebra 
aspect. 

For what considers this presentation, two aspects are of crucial importance. On the one hand, 
Bogoliubov's recursive algorithm and the Birkhoff decomposition derived from this by Connes and 
Kreimer are an instance of a general decomposition principle valid in any Rota-Baxter algebra, and 
known as Atkinson's decomposition. On the other hand, the work of Connes and Kreimer triggered 
considerable research into what is nowadays known as the theory of combinatorial Hopf algebras as 
well as into the theory of pre-Lie algebras. This lead also to a renewed interest in noncommutative 
Rota-Baxter algebras. 

The rest of this work is organized as follows. In the following section we recall Atkinson's decom- 
position for Rota-Baxter algebras. The generalization of Spitzer-type formulas to noncommutative 
Rota-Baxter algebras is presented in Section [3.21 In this context, the notion of pre-Lie algebra is 
central. In Section 13.31 we describe the noncommutative analogs of Cartier's and Rota's construc- 
tions. We indicate how for noncommutative Rota-Baxter algebras the theory of noncommutative 
symmetric functions plays a role analogue to that of symmetric functions for commutative Rota- 
Baxter algebras described by Rota. By using either examples or references to other works, we end 
this article with a brief outline of possible applications of Rota-Baxter algebras - note that this 
last part mainly reflects the taste and interests of the authors. 



3.1. Atkinson's Factorization. In 1963, based on Baxter's and Spitzer's works [50], F. Atkinson 
immersed himself into the factorization problem of Rota-Baxter algebras [lj. However, he did not 
limit himself to commutative algebras. 

Let A be a Rota-Baxter algebra of weight 9. We denote by R the operator —6\&a—R- It is easy to 
show that it is a Rota-Baxter operator of weight 6. Moreover, the identity R{x *g y) = —R(x)R(y) 
holds. Note that for = 0, we have R = —R. For x € A, Atkinson as well as Baxter were interested 
in the equations: 

f = l + XR(fx), h = l + XR{xh). 

The formal parameter A is introduced to circumvent any discussions regarding convergence or 
invertibilty issues. The solutions of the two equations are given in terms of the argument of the 
logarithm of the left hand side of the Spitzer identity (J7J) for / (here F = Xx), and an analogue 
series expansion for h. 

The identity R(a)R(b) = R(aR(b)^ -\-R(R(a)b) implies one of Atkinson's central results, i.e., the 
identity fh = 1 — XOfxh in A[[X\] which yields: 

(13) l + \0x = f- 1 hT x . 
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Let us for a moment return to renormalization in the perturbative quantum field theory. We 
limit ourself to the algebraic perspective (without going into the details, for which we refer the 
reader to Manchon's paper |40] and Collins' classical monograph |16|). The operator R is called 
the subtraction scheme map and is defined analogous to the second example in Section 12.21 it is a 
projector into a subalgebra which, so to say, contains the divergences. The map R is therefore an 
idempotent Rota-Baxter map of weight 9 = — 1; it verifies moreover R(l) = 0. We now omit the 
parameter A. The term 1 — x in Atkinson's identity (|13|) is the object to be renormalized, i.e., a 
series of divergent integrals. The quantity / encodes the divergences. Classically it is denoted C and 
called counterterm in renormalization theory. The quantity h~ l is the result of the renormalization 
process, and consists of a series of terms each of which is finite. Note that we neglect deliberately 
any aspects related to questions of summability, which is another problem, where simple algebra 
may not be enough to say much. 

Bogoliubov's renormalization method is a recursive process, which permits to calculate /i _1 from 
x. From a Rota-Baxter algebra point of view, it is just an example of Atkinson's decomposition. 
From /(l — x) = /i -1 we deduce that: 

/ = 1 + R(f) = 1 + R(f- h- 1 ) = 1 + R(fx), 

and: 

hT 1 = R(h^) = R(f(l - x )) = l- R(fx). 

In fact, h~ l is in the image of the orthogonal projector R, and / — 1 is in the image of R. Therefore, 
we have R(h~ l ) = and R(f) = 1. Using the underlying series expansions (and, more precisely, 
the grading of the terms in these series), one shows that the two equations are coupled: 

h- 1 = 1 - R(fx), f = l + R(fx). 

Bogoliubov's algorithm then amounts to solving this system recursively degree by degree (here 
we indicate the degree of the terms in each series by an index j, such that for j = 1,2 we find 
h^ 1 = -R(xi), /i = R(xi), h^ 1 = -R{f\X\ + x 2 ), h = R(flXi + x 2 ), etc.). 

3.2. Noncommutative Spitzer Identity. One of the natural questions is then to understand 
the expression on the left hand side of Spitzer 's identity, i.e., the logarithm of the solution of the 
equation / = 1 + R(fx). 

Notice that, depending on the algebra A and operator R under consideration, one may either 
view this solution as the counterterm in renormalization theory, or, to mention two more familiar 
examples, the solution of a 1st order linear differential equation in a matrix algebra, or the linear 
fixed point equation of a discrete dynamical system. In particular, in view of the penultimate 
example, one would expect that a "noncommutative Spitzer formula" should solve, among others, 
the Baker-Campbell-Hausdorff problem (to compute the logarithm of the solution of a 1st order 
linear differential equation). We will see that this is indeed the case. 

Key in the theory of noncommutative Rota-Baxter algebras is the existence of another algebraic 
structure beside the double product ([2]). Indeed, any Rota-Baxter algebra comes equipped with an 
underlying pre-Lie (or Vinberg) algebra structure [HI [TH Hi] . Recall that a (left) pre-Lie algebra 
is a vector space endowed with a (non-associative) product, which verifies an identity weaker than 
associativity, and known as (left) pre-Lie relation: 

(14) (x > y) > z - x > (y \> z) = (y > x) \> z — y \> (x > z). 

Right pre-Lie algebras are defined similarly. Pre-Lie algebras are Lie admissible, i.e., the bracket 
[x, y]> := x > y — y > x satisfies the Jacobi identity: 

[x, [y, z]>]> + [y, [z, a;]>]> + [z, [x, y]>]> = 0. 
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The canonical example of a pre-Lie algebra, which can be traced back to Caley's work on rooted 
trees, is given in terms of derivations. One verifies for example that the space of vector fields 
generated by the derivations x n d x is a pre-Lie algebra with product {x n d x ) > (x m d x ) := m ■ 



x- 



Any Rota-Baxter algebra (of weight 0) inherits automatically a (left) pre-Lie structure, which 
is defined similarly to the double product: 

(15) at> e b:= R(a)b - bR(a) - Oba. 

In fact, both left and right pre-Lie products can be defined. We will focus on the former product, 
since ag<\b := —bt>Qa is right pre-Lie. Note that the Lie brackets associated to [># and *g identify: 
[a, 6]|> e = [a, 6]* e . For = 0, we have a \>o b = [R(a),b] := R(a)b — bR(a). Obviously, in the 
commutative case the pre-Lie product simplifies to the original product of the algebra (up to a 
scaling by the weight 0). 

To understand the role played by the pre-Lie product we return to the Bohnenblust-Spitzer 
formula, using the description presented in (|10p . which appeared artificial in the commutative case. 
For n = 2 we see quickly that: 

Fy * e F 2 - 6F 2 F 1 = R(F 1 )F 2 + R(F 2 )F 1 

provided that the Rota-Baxter algebra is commutative. In the noncommutative case the slightly 
less obvious identity holds: 

Fi * e F 2 + F 2 > e F x = R(F 1 )F 2 + R(F 2 )F 1 . 

This simple procedure of replacing the algebra product by the pre-Lie product generalizes to all 
orders. Indeed, it is "sufficient" to redefine in (fT0|) the operator T> e a by substituting in definition (fTTj) 
the operator r$ x (right product by Ox) with the operator r >gx , again a right multiplication operator, 
but defined now in terms of the pre-Lie product, r >gX (y) := y \>g x. 

Using the example following the definition (jlip , we substitute in the Bohnenblust-Spitzer formula 
the expression: 

vi(F lt . . . ,F 5 ) = 3 (F 4 F 3 ) * e (F 5 F,F 2 ), 
by the one defined in terms of the pre-Lie product: 

2>>»(Fi, . . . ,F 5 ) = r >eF3 (F 4 ) * e (r >f)F2 r >gFl )(F 5 ) 

= ^ > e F 3 ) * e ((F 5 >e F ± ) > e F 2 ). 

With this redefinition, the Bohnenblust-Spitzer identity (|10p holds in any noncommutative Rota- 
Baxter algebra of weight [23] . The identity has even a g-analogue [52] . 

To approach the Baker-Campbell-Hausdorff problem generalized to Rota-Baxter algebra, it is 
useful to recall Spitzer's classical formula and its rewriting in terms of the generating series of the 
Bernoulli numbers (|8|), and to compare it to Magnus classical solution of the problem. In his seminal 
1954 paper [39], Magnus considered the (say, matrix valued) solution Y(t) of the linear differential 
equation Y(t) = A(t)Y(t), Y(0) = 1. He showed that the logarithm Q(A)(t) := log(Y(i)) is the 
solution of the differential equation: 

where ad is the ordinary adjoint action (ad x (y) = [x,y] = xy — yx). We remark that, since 
£1 (A)(0) = 0, the fact that the indefinite Riemann integral is a weight zero Rota-Baxter map, 
implies that: 

ad n(A) (A) = {(1(A), A\ = (1(A) o d A = l^ A>a (A), 



m = d^rW. 
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where £ x>0 {y) := x > V- 

The generalization to arbitrary Rota-Baxter algebras unifies Spitzer's classical formula with 
Magnus' expansion thanks to the pre-Lie product. Let A be a Rota-Baxter algebra of weight 
6. If / is a solution of / = 1 + XR(fx) in -A[[A]], then the element Q'(Xx) defined such that 
R{n'(Xx)) = log(/) satisfies [25]: 

< i6 > n '^ = ^:w ^ = ^e^^jh 

where £ x > e (y) '■= x \>q y. This series is called pre-Lie Magnus expansion. Here, the prime notation 
shall remind the reader of Magnus' original differential equation. As a remark we mention that 
a similar approach applies to Fer's expansion |25j . F. Chapoton showed in [15] that (|16p is of 
significant interest in the context of the theory of Lie idempotents as well as the theory of operads. 
Let us make the first few terms of Cl'(Xx) explicit: 

Q'(Xx) = Xx + -A 2 x >o x + -:A 3 (x >0 x) > e x + — A 3 x \> e (x \> e x) + 

-A 4 -((x >g X) >g X)> e x- A 4 — (0 \>g (x \>g x)) >g X 



-X \>g ((x \>g x) \>Q X) + (x \>Q X) \>Q (x \>0 x) J 



+ 



Observe that using ([Hj) the four order A 4 terms reduce to: 

--((X \>0 x) \> e x) > e x - —x \> e ((x \> e x) \> 9 x). 

3.3. The word problem. We have seen that the case of weight zero Rota-Baxter algebras 
is representative for the classical theory of integration (and differential) calculus. The pre-Lie 
product reduces to the ordinary Lie bracket twisted by the indefinite Riemann integral map, 
a >o b — R(a)b — bR(a) =: [R(a),b]. Hence, Magnus classical expansion seen as the weight 
9 = noncommutative analogue of Spitzer's formula clearly emphasizes the key role played by the 
pre-Lie structure in noncommutative Rota-Baxter algebras. Moreover, the simple generalization 
to the non-zero weight case allows for a coherent and natural passage to the full noncommutative 
analogue of Spitzer's formula. 

The classical word problem leads to another class of fundamental ideas and structures related 
Rota-Baxter algebras. We have seen that the noncommutative analogue of Cartier's theory is 
intimately related to noncommutative analogues of shuffle and quasi-shuffle algebras. A natural 
way to represent these structures is in terms of graphical, i.e., combinatorial objects, such as planar 
rooted trees [31 [21] . 

On the other hand, the description of the noncommutative analogue of Rota's classical solu- 
tion to the word problem [231 [23] is surprisingly straightforward. One simply replaces in Rota's 
original work the algebra of polynomials in an infinite number variables k[x\, . . . ,x n , . . .] by its 
associative analogue, i.e., the free associative algebra (or tensor algebra) over the alphabet X := 
{xi, . . . , x n , . . .}. Rota's results continue to hold in this noncommutative setting, that is, the oper- 
ator R(xi, . . . , x n , ...):= (0, xi, xi + X2, . . . , xi + • • • + x n , . . .) still is a Rota-Baxter map, and the 
Rota-Baxter subalgebra generated by the sequence x = (xi, . . . , x n , . . .) gives a presentation of the 
free Rota-Baxter algebra over one generator x. 

Beside the fact that this gives a simple answer to the word problem, it generalizes Rota's sem- 
inal insight, unveiling the link between free commutative Rota-Baxter algebras and symmetric 
functions, to one of the fundamental notions in modern algebraic combinatorics, i.e., the theory 
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of noncommutative symmetric functions -the latter has been developed in the last 20 years by 
J.-Y. Thibon and his collaborators (G. Duchamp, F. Hivert, J.-C. Novelli inter alia) in a series of 
important articles from [30] to [20] , 

Indeed, recall that, if we calculate the terms on the left hand side of Spitzer's classical iden- 
tity in the commutative case, i.e., RS n '(x) = R(R^ n ^ 1 '(x)x), using Rota's presentation of the free 
Rota-Baxter algebra on one generator, then we find a sequence of elementary symmetric functions 
Ylo<i <■■■<« <k x h ' ' ' x in "this is the basis for Rota's proof of the Spitzer formula, as a corollary of 
Waring's identity. In the noncommutative case the same observation holds, but this time noncom- 
mutative variables enter the picture. As a result F. Hivert's theory of quasi-symmetric functions 
in noncommutative variables applies. This approach shares the same advantages with Rota's orig- 
inal work. Indeed, it allows to apply a whole range of results and techniques from the theory of 
noncommutative symmetric functions (and variants of it, such as Solomon's descent algebra, which 
plays a crucial role in the theory of free Lie algebras [43]), see e.g. [23], [24] . 

One may summarize this by the remark that this approach points in the direction of far reaching 
generalizations of Spitzer's formula beyond those which we have just sketched. This is of course 
not surprising, as it is coherent with the general observation that passage from the commutative 
to the noncommutative realm in general is non-unique. The theory of noncommutative symmetric 
functions [30] provides an illustrative example for this. 

3.4. Applications and Perspectives. We have seen that the theory of Rota-Baxter algebra nat- 
urally provides the setting to work abstractly with common notions such as indefinite integrals (or 
dually derivations), summation operators, splitting of algebras. The range of possible applications 
is therefore numerous. We limit ourself to mention only two examples, one in the direction of 
differential equations, the other with respect to universal algebra. 

3.4.1. Control Theory. Control theory of differential equations has been for a long time associated 
with the combinatorics of iterated integrals, and its formal variant, the combinatorics of words and 
shuffles. M. Fliess' works |28] play an eminent role in these developments at this interface between 
combinatorics and analysis. 

From the point of view of integration (recall that the indefinite Riemann integral is Rota-Baxter 
of weight zero), the underlying combinatorics is the one of weight zero Rota-Baxter algebras. In 
one dimension the commutative case can be used. Beyond that, i.e., in the case of operator valued 
functions, noncommutativitiy enters the picture. 

It is therefore not too surprising to discover certain ideas from noncommutative Rota-Baxter 
algebras in modern control theory. Indeed, chronological calculus is based on the notion of chrono- 
logical algebra, which is the (weight zero) pre-Lie structure that derives from the Riemann integral 
in noncommutative algebras of matrix or operator valued functions. A similar remark applies also 
in the context of theory of numerical methods for differential equations. 

The work of A. Agrachev and R. Gamkrelidze serves as a remarkable example [I] HI]. These 
authors systematically developed the theory of free chronological algebras, i.e., free pre-Lie algebras. 
This work allows for example to understand the relationship between the pre-Lie Magnus expansion 
and the discrete Baker-Campbell-Hausdorff formula (which allows to calculate log(exp(x) exp(y)) 
when x and y do not commute: 

1 1 

BCH(x, y):=x + y + -[x,y} + —([x, [x, y}\ + [y, [y, x}}) + ■■■) 

in the context of Rota-Baxter algebra. Using their construction of the group of flows one can show, 
for a general Rota-Baxter algebra, that the product I = fh of solutions / and h of the fixed point 
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equations / = 1 + R(fx) and h = l + XR(hy), respectively, solves the equation I = 1 + R(lz), where: 

z = x»y:=x + e 9n '(. x )>ey. 

Here Ci'(x • y) = BCH* g (Q'(x), Q,'(y)), and BCH* g stands for the Baker-Campbell-Hausdorff 
formula defined in terms of the Lie bracket following from the Rota-Baxter double product *g. 

3.4.2. Algebra Splitting and Yang-Baxter Equations. We have seen how in an associative algebra 
Rota-Baxter maps permit to define new algebraic structures. In the following we restrict ourself 
to the weight zero case. However, the reader should have no problem to generalize the results to 
arbitrary weight. 

Let A be a weight zero Rota-Baxter algebra. The half-shuffles x f V '■= xR{y) and x 1 y := R(x)y 
combine to an abstract shuffle-like product on A. In the commutative case, one verifies that: 

(17) a J, b = 6 t a, a I (b I c) = (a I b + b J, a) I c. 

In the noncommutative case we do not longer have a I b = b f a, and the half-shuffles satisfy a 
system of axioms that characterizes noncommutative half-shuffles. In modern terms one says that 
A is equipped with a dendriform algebra structure defined in terms of the compositions 1 and t 
[21I1Q]: 

(18) (atk)tc = at(Hc + Hc) 

(19) ol(6tc) = (o4-b)tc 

(20) a I (b I c) = (a f 6 + a 1 6) j c. 

M. Aguiar pointed out in [2] that this splitting principle should not be limited to associative 
algebras -an idea that appears to be important from the point of view of universal algebra. In the 
same article, he established a link between weight zero Rota-Baxter operators and the equation: 

(21) r 13 r 12 - r 12 r 2 3 + ^23^12 = 0, 

known as the associative analog of the classical Yang-Baxter equation. He proved that if in an 
associative algebra A, the element r = Yl u i ® v i € A (g> A satisfies (f2T|) . then the map R r (x) := 
^2 UiXVi is Rota-Baxter of weight zero (recall that r\% is defined as ^ U{ <8> vi <8> 1 € A <8> A <8> A, and 
analogously for the other r^). 

Next we will show that there exists another link between Rota-Baxter operators and the Yang- 
Baxter equation. But before that, we look at the half-shuffle products t and L Let V be a vector 
space equipped with a product * (a bilinear map with values in V, for the time being we do not 
assume that it has particular properties). We can then define the notion of a (weight 0) Rota-Baxter 
algebra operator R on V for the * product by relation: 

R(x) * R(y) = R(R(x) *y + x* R(y)). 

When the product satisfies relations (e.g. associativity, commutativity, Jacobi identity, pre-Lie or 
shuffle identities...), one would expect the "half-products" x f y = x * R{y) and x \, y = R(x) * y 
to inherit remarkable properties from those of *. We know already that this is the case when * is 
associative or commutative, but this is true for a wider class of theories. We refer to [22(, [51 151] . 
where these phenomena are studied in detail and limit here our attention to the particular case of 
Lie algebras, that fits nicely in the framework of the present article. 

Let L be a Lie algebra equipped with a Rota-Baxter map R of weight zero. Hence, R satisfies: 

(22) [R(x), R(y)} = R([R(x), y] + [x, R(y)}), 

which is the -operator form of- classical Yang-Baxter equation. This is enough for the bracket 
[x,y]n := ([R(x),y] + [x,R(y)]) to be a Lie bracket [49]. Analogous to the associative case, this 
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"double" Lie bracket [— , —]r is the addition of the products x ^ y := [x, R(y)] and x 4- y := [i?(x), y]: 
[x, y]# = xty + ^|y- However, the two products t and I are right and left pre-Lie, respectively 
and the Lie bracket [—,—]# can be written: 

[x,y)R = x J [y -y^ x. 

In the case of a Rota-Baxter algebra A of arbitrary weight, the corresponding Lie algebra At, is 
a Rota-Baxer Lie algebra for the map R. We recover in that case the same algebraic structures 
as described further above (that is, the pre-Lie structures arising from the associative and Lie 
Rota-Baxter structures identify). 

Let us return to the general case of an arbitrary weight 8. We can go a bit further in the 
direction of Yang-Baxter equations and their link to Rota-Baxter algebras. The classical Yang- 
Baxter equation admits another variant, which plays an important role in the seminal work of 
Semenov-Tian-Shansky |49] , He introduced the modified Yang-Baxter equation: 

(23) [B(x),B(y)] = B([B(x), y] + [x, B{y)\) - 9 2 [x, y\. 

It is sufficient to guarantee that the bracket [x,y]_B := ^([B(x),y] + [x,B(y)]) is a Lie bracket. One 
verifies quickly that in an associative Rota-Baxter algebra A of weight 9, the associative analogue: 

(24) B(x)B(y) = B{B(x)y + xB(y)) - 6 2 xy, 

is satisfied by the operator B := R — R. As well, the operator B satisfies in Al the modified Yang- 
Baxter equation. Using Semenov-Tian-Shansky's terminology, the map B defines an associative 
double structure on A. We finally remark that the Rota-Baxter double product ([2]) rewrites: 
x *e y = \{B{x)y + xB(y)). 

These various results and examples presented in this article do certainly not give an exhaustive 
picture of the (existing or forthcoming) application domains of Rota-Baxter algebra techniques. 
However, we believe they render the flavor of the theory, and its power to attack many different 
problems, and contribute strongly to the development of ideas that are relevant both to the general 
theory of algebraic structures and to applications. 
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